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Centralised Bandwidth Allocation for Multicast Traffic 



Background 

Traffic volume in the Internet is growing exponentially, doubling every 3-6 months. The current 
capacity of IP Routers is insufficient to meet this demand and hence there is a market opportunity for 
products that can route IP traffic at extremely large aggregate bandwidths in the order 10 Gbit/s - 
several Terra bit/s. Such routers are termed "Super Routers". 

Additionally, there is a growing need to support multicast (one to many or many to many) 
communications within the internet or any other IP based network. To support such service an IP router 
must be able to replicate packets and send them to multiple outputs on a per packet basis. In a router 
where bandwidth allocations are strictly controlled (in order to support Quality of Service criteria) it is 
necessary to determine how much bandwidth to allocate to multicast traffic across the core switching 
fabric. 

Invention 

The invention relates to a method of dynamically adjusting the bandwidth, allocated to multicast traffic, 
across an ATM or crossbar like switching fabric that joins several IP packet forwarder functions to 
form a "Super Router" node. 

Such a scheme is presented in the following description. In order to prevent head of line blocking 
unicast traffic is queued in separate logical scheduling entities (called scheduler blocks) according to 
which egress forwarder it is destined. The scheduler block serves a set of queues (per class or per 
connection) via any mechanism desired (e.g. strict priority or Weighted Fair Queuing) provided that the 
real time IP traffic class is guaranteed a rninimum bandwidth. 

However, for multicast traffic it is not practical to queue traffic on the basis of a unique combination of 
egress destinations as the number of queues required becomes unmanageable even for a relatively small 
number of egress ports. Hence a separate multicast scheduler block is used in each ingress forwarder 
containing one real time multicast queue and one or more non-real time multicast queues as shown in 
Figure 1. 

This structure of the ingress forwarder is highlighted in Figure 1. Incoming IP traffic from the line is 
queued in the relevant queues associated with a scheduler block. The choice of scheduler block is 
governed by the destination egress forwarder and whether it is multicast or unicast traffic. The class of 
service determines the specific queue to be utilised. 
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Figure 1 Ingress Forwarder Scheduling Function 

The overall view of a centralised bandwidth allocation scheme is shown in Figure 2 below. Every fixed 
period (the Bandwidth Allocation Period - BAP) each ingress forwarder sends buffer occupancy (and 
possibly other information) to the central bandwidth allocation controller. In addition we also require 
each egress forwarder to send information on how many multicast cells were received in the last BAP 
from each ingress forwarder. The bandwidth allocation controller works out the allocation of 
bandwidth between all ingress/egress forwarder pairs for the next BAP and uses this to provide 
scheduling information to the ingress forwarders telling them which cells/packets to transmit in the next 
Ie^II period. 
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Figure 2 



Centralised Bandwidth Allocation 



In order to include multicast functionality into the bandwidth allocation process some additions are 
required to the unicast algorithm defined in the Roke Manor patent 1998P04863 (Centralised 
Bandwidth Allocation Mechanism). The unicast bandwidth allocation algorithm essentially divides the 
available bandwidth at ingress and egress amongst competing forwarders using the ingress queue length 
as a weighting factor. The queue length of the unicast scheduler block for traffic on ingress forwarder . i 
destined for egress forwarder j is denoted by q l} . Thus for example the amount of bandwidth allocated 
to unicast traffic from ingress forwarder i to egress forwarder j (bey) is given by the following equation. 



(1) 



Here ABWj is the available bandwidth (after real time reservations have been accounted for) at the 
egress forwarder j. 



For real time multicast flows, the fan-out and bandwidth guarantees are known in advance and the sum 
of all ingress and egress requirements can be subtracted from the available bandwidth in the same way 
as for real time unicast traffic flows. 

As the amount of egress bandwidth required for non-real time multicast flows is not known (compared 
with the case for real time multicast) it must be learnt by the system. One solution to this problem is to 
collect statistics at the egress forwarders on the number of multicast cells received from each ingress 
forwarder in the last Bandwidth Allocation Period (BAP). These statistics can then be included in the 
queue statistics message sent from the ingress forwarder to the central scheduler every BAP. 

Figure 3 shows the relevant terminology used for calculating non-real time multicast bandwidth 
allocation. The ingress forwarder multicast queue occupancy is denoted as mcq; for ingress forwarder i. 
The number of multicast cells received by egress forwarder j from ingress forwarder i in the last BAP is 
denoted by mcqij. The bandwidth allocated to non-real time multicast flows from ingress forwarder i to 
egress forwarder j is denoted by mcby. 



SPD 



3 



Ingress LfC 



Egress LIC 




Figure 3 



Effective Queue lengths for Non-Real Time Multicast Bandwidth Allocation 



The value of mcqj is used in the ingress bandwidth fair share in the same manner as q^ does in the 
unicast centralised bandwidth allocation algorithm. 

The values mcqy take part in the egress fair share allocation by providing a proportion with which to 
scale the ingress multicast queue occupancies. This means that the effective weight that the occupancy 
of the ingress nrt multicast queue (mcq ; ) has on an egress forwarder j (called emcqjj) is determined by 
the proportion of nrt cells received by egress forwarder j compared to all those sent in the last BAP 
period. It is therefore governed by the following equation: 

\mcq . = mcq { * ™ cq ' lj (2) 

k 

The value of emcqjj will be used in egress bandwidth allocation functions alongside the unicast 
equivalents qjj. 

Thus the equivalent of equation (1) when including the multicast traffic is given in equation (3). 



bme i} = abw j 



emcq.. 



(3) 



Similar principles can be applied at the ingress for bandwidth allocation and any left over bandwidth 
can be distributed between unicast and multicast allocations by any fair mechanism required. 
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